Overview

Dataset statistics

Number of variables24
Number of observations13644
Missing cells67159
Missing cells (%)20.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.1 MiB
Average record size in memory241.0 B

Variable types

NUM12
CAT11
BOOL1

Reproduction

Analysis started2020-06-03 05:46:12.258482
Analysis finished2020-06-03 05:47:18.854152
Duration1 minute and 6.6 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

state has constant value "WA" Constant
propertyName has a high cardinality: 697 distinct values High cardinality
street has a high cardinality: 697 distinct values High cardinality
neighborhood has a high cardinality: 87 distinct values High cardinality
unit has a high cardinality: 1726 distinct values High cardinality
name has a high cardinality: 4476 distinct values High cardinality
rent has 4132 (30.3%) missing values Missing
sqft has 213 (1.6%) missing values Missing
deposit has 13452 (98.6%) missing values Missing
unit has 9113 (66.8%) missing values Missing
leaseLength has 13480 (98.8%) missing values Missing
name has 335 (2.5%) missing values Missing
new has 13121 (96.2%) missing values Missing
applyNow has 13309 (97.5%) missing values Missing
Unnamed: 0_x has 178 (1.3%) zeros Zeros
reviewScore has 9224 (67.6%) zeros Zeros
reviewCount has 5516 (40.4%) zeros Zeros
transitScore has 212 (1.6%) zeros Zeros

Variables

Unnamed: 0_x
Real number (ℝ≥0)

ZEROS

Distinct count752
Unique (%)5.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean239.02059513339196
Minimum0
Maximum751
Zeros178
Zeros (%)1.3%
Memory size106.6 KiB

Quantile statistics

Minimum0
5-th percentile12
Q158
median161
Q3365.25
95-th percentile711
Maximum751
Range751
Interquartile range (IQR)307.25

Descriptive statistics

Standard deviation223.0319473
Coefficient of variation (CV)0.9331076562
Kurtosis-0.2335259764
Mean239.0205951
Median Absolute Deviation (MAD)119
Skewness0.982545378
Sum3261197
Variance49743.24952
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
672201.6%
 
7342201.6%
 
612001.5%
 
6992001.5%
 
01781.3%
 
3621781.3%
 
571581.2%
 
6761581.2%
 
61571.2%
 
631481.1%
 
Other values (742)1182786.7%
 
ValueCountFrequency (%) 
01781.3%
 
1400.3%
 
2680.5%
 
3260.2%
 
4540.4%
 
ValueCountFrequency (%) 
75180.1%
 
750480.4%
 
7496< 0.1%
 
74880.1%
 
747120.1%
 

propertyName
Categorical

HIGH CARDINALITY

Distinct count697
Unique (%)5.1%
Missing0
Missing (%)0.0%
Memory size106.6 KiB
Batik
 
440
Alexan 100
 
400
Jackson Apartments
 
356
624 Yale
 
316
Sedona Apartments
 
296
Other values (692)
11836
ValueCountFrequency (%) 
Batik4403.2%
 
Alexan 1004002.9%
 
Jackson Apartments3562.6%
 
624 Yale3162.3%
 
Sedona Apartments2962.2%
 
Assembly1182882.1%
 
Met Tower2722.0%
 
STAZIONE252641.9%
 
Saxton2521.8%
 
McKenzie1761.3%
 
Other values (687)1058477.6%
 

Length

Max length45
Median length11
Mean length12.62943418
Min length2

street
Categorical

HIGH CARDINALITY

Distinct count697
Unique (%)5.1%
Missing0
Missing (%)0.0%
Memory size106.6 KiB
123 Broadway
 
440
100 Denny Way
 
400
2401 S Jackson St
 
356
624 Yale Ave N
 
316
8500 20th Ave NE
 
296
Other values (692)
11836
ValueCountFrequency (%) 
123 Broadway4403.2%
 
100 Denny Way4002.9%
 
2401 S Jackson St3562.6%
 
624 Yale Ave N3162.3%
 
8500 20th Ave NE2962.2%
 
4200 S Othello St2882.1%
 
1942-1942 Westlake Ave2722.0%
 
2615 25th Ave S2641.9%
 
520 Terry Ave2521.8%
 
2202 Eighth Ave1761.3%
 
Other values (687)1058477.6%
 

Length

Max length38
Median length15
Mean length15.57248607
Min length10

city
Categorical

Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size106.6 KiB
Seattle
13603
Lake Forest Park
 
26
Burien
 
11
Shoreline
 
4
ValueCountFrequency (%) 
Seattle1360399.7%
 
Lake Forest Park260.2%
 
Burien110.1%
 
Shoreline4< 0.1%
 

Length

Max length16
Median length7
Mean length7.016930519
Min length6

state
Categorical

CONSTANT
REJECTED

Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size106.6 KiB
WA
13644
ValueCountFrequency (%) 
WA13644100.0%
 

Length

Max length2
Median length2
Mean length2
Min length2

zipCode
Real number (ℝ≥0)

Distinct count32
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean98116.88199941366
Minimum98101
Maximum98199
Zeros0
Zeros (%)0.0%
Memory size106.6 KiB

Quantile statistics

Minimum98101
5-th percentile98101
Q198108
median98115
Q398122
95-th percentile98144
Maximum98199
Range98
Interquartile range (IQR)14

Descriptive statistics

Standard deviation14.62209843
Coefficient of variation (CV)0.0001490273451
Kurtosis8.608362182
Mean98116.882
Median Absolute Deviation (MAD)7
Skewness2.369625893
Sum1338706738
Variance213.8057626
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
98109248218.2%
 
98122147110.8%
 
98121146310.7%
 
9810110217.5%
 
981159547.0%
 
981447375.4%
 
981167185.3%
 
981047055.2%
 
981257055.2%
 
981075183.8%
 
Other values (22)287021.0%
 
ValueCountFrequency (%) 
9810110217.5%
 
981021591.2%
 
981035053.7%
 
981047055.2%
 
981053442.5%
 
ValueCountFrequency (%) 
98199580.4%
 
98198150.1%
 
98188850.6%
 
98178250.2%
 
981771< 0.1%
 

neighborhood
Categorical

HIGH CARDINALITY

Distinct count87
Unique (%)0.6%
Missing4
Missing (%)< 0.1%
Memory size106.6 KiB
South Lake Union
 
1611
First Hill
 
1074
Belltown
 
993
Denny Triangle
 
956
Lower Queen Anne
 
840
Other values (82)
8166
ValueCountFrequency (%) 
South Lake Union161111.8%
 
First Hill10747.9%
 
Belltown9937.3%
 
Denny Triangle9567.0%
 
Lower Queen Anne8406.2%
 
Capitol Hill7375.4%
 
Ballard4903.6%
 
Atlantic4253.1%
 
Brighton3562.6%
 
Roosevelt3502.6%
 
Other values (77)580842.6%
 

Length

Max length31
Median length10
Mean length11.57292583
Min length3

reviewScore
Real number (ℝ≥0)

ZEROS

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5583406625622984
Minimum0
Maximum5
Zeros9224
Zeros (%)67.6%
Memory size106.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q35
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.264788044
Coefficient of variation (CV)1.453333086
Kurtosis-1.351812219
Mean1.558340663
Median Absolute Deviation (MAD)0
Skewness0.7846450508
Sum21262
Variance5.129264885
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0922467.6%
 
5364126.7%
 
47385.4%
 
3230.2%
 
2180.1%
 
ValueCountFrequency (%) 
0922467.6%
 
2180.1%
 
3230.2%
 
47385.4%
 
5364126.7%
 
ValueCountFrequency (%) 
5364126.7%
 
47385.4%
 
3230.2%
 
2180.1%
 
0922467.6%
 

reviewCount
Real number (ℝ≥0)

ZEROS

Distinct count24
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.4695836997947818
Minimum0
Maximum150
Zeros5516
Zeros (%)40.4%
Memory size106.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q34
95-th percentile15
Maximum150
Range150
Interquartile range (IQR)4

Descriptive statistics

Standard deviation7.751634553
Coefficient of variation (CV)2.234168484
Kurtosis139.4384503
Mean3.4695837
Median Absolute Deviation (MAD)1
Skewness9.025237986
Sum47339
Variance60.08783825
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0551640.4%
 
1220116.1%
 
211658.5%
 
59416.9%
 
38846.5%
 
47205.3%
 
63362.5%
 
82842.1%
 
152611.9%
 
231991.5%
 
Other values (14)11378.3%
 
ValueCountFrequency (%) 
0551640.4%
 
1220116.1%
 
211658.5%
 
38846.5%
 
47205.3%
 
ValueCountFrequency (%) 
150140.1%
 
53830.6%
 
24840.6%
 
231991.5%
 
22130.1%
 

walkScore
Real number (ℝ≥0)

Distinct count75
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean90.7982996188801
Minimum9
Maximum100
Zeros0
Zeros (%)0.0%
Memory size106.6 KiB

Quantile statistics

Minimum9
5-th percentile69
Q190
median96
Q398
95-th percentile99
Maximum100
Range91
Interquartile range (IQR)8

Descriptive statistics

Standard deviation12.17441151
Coefficient of variation (CV)0.1340819328
Kurtosis7.132671395
Mean90.79829962
Median Absolute Deviation (MAD)3
Skewness-2.507152451
Sum1238852
Variance148.2162956
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
99228516.7%
 
98143810.5%
 
97136610.0%
 
9611068.1%
 
957875.8%
 
926955.1%
 
906815.0%
 
1006724.9%
 
935804.3%
 
893892.9%
 
Other values (65)364526.7%
 
ValueCountFrequency (%) 
91< 0.1%
 
101< 0.1%
 
161< 0.1%
 
191< 0.1%
 
206< 0.1%
 
ValueCountFrequency (%) 
1006724.9%
 
99228516.7%
 
98143810.5%
 
97136610.0%
 
9611068.1%
 

transitScore
Real number (ℝ≥0)

ZEROS

Distinct count67
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean77.04954558780416
Minimum0
Maximum100
Zeros212
Zeros (%)1.6%
Memory size106.6 KiB

Quantile statistics

Minimum0
5-th percentile50
Q159
median82
Q3100
95-th percentile100
Maximum100
Range100
Interquartile range (IQR)41

Descriptive statistics

Standard deviation20.85972055
Coefficient of variation (CV)0.2707312599
Kurtosis0.8524035019
Mean77.04954559
Median Absolute Deviation (MAD)18
Skewness-0.7662133003
Sum1051264
Variance435.1279416
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
100392328.8%
 
547585.6%
 
826024.4%
 
595414.0%
 
664743.5%
 
564423.2%
 
704373.2%
 
524073.0%
 
644053.0%
 
843852.8%
 
Other values (57)527038.6%
 
ValueCountFrequency (%) 
02121.6%
 
334< 0.1%
 
341< 0.1%
 
3580.1%
 
361< 0.1%
 
ValueCountFrequency (%) 
100392328.8%
 
99730.5%
 
9890.1%
 
97990.7%
 
96720.5%
 

Unnamed: 0_y
Real number (ℝ≥0)

Distinct count10762
Unique (%)78.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5480.078789211375
Minimum0
Maximum10761
Zeros2
Zeros (%)< 0.1%
Memory size106.6 KiB

Quantile statistics

Minimum0
5-th percentile462
Q12532
median5383.5
Q38705.25
95-th percentile10408.85
Maximum10761
Range10761
Interquartile range (IQR)6173.25

Descriptive statistics

Standard deviation3352.311984
Coefficient of variation (CV)0.6117269684
Kurtosis-1.368434903
Mean5480.078789
Median Absolute Deviation (MAD)3056.5
Skewness0.0159681242
Sum74770195
Variance11237995.64
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
02< 0.1%
 
105472< 0.1%
 
105872< 0.1%
 
88242< 0.1%
 
26402< 0.1%
 
26852< 0.1%
 
105792< 0.1%
 
96062< 0.1%
 
88322< 0.1%
 
26482< 0.1%
 
Other values (10752)1362499.9%
 
ValueCountFrequency (%) 
02< 0.1%
 
12< 0.1%
 
22< 0.1%
 
32< 0.1%
 
42< 0.1%
 
ValueCountFrequency (%) 
107612< 0.1%
 
107602< 0.1%
 
107592< 0.1%
 
107582< 0.1%
 
107572< 0.1%
 

bedRoomType
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size106.6 KiB
Traditional
10842
Studio
2802
ValueCountFrequency (%) 
Traditional1084279.5%
 
Studio280220.5%
 

Length

Max length11
Median length11
Mean length9.973175022
Min length6

bedRoomNumber
Real number (ℝ≥0)

Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2837144532395193
Minimum1
Maximum6
Zeros0
Zeros (%)0.0%
Memory size106.6 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile2
Maximum6
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.5061189284
Coefficient of variation (CV)0.3942612994
Kurtosis4.173943486
Mean1.283714453
Median Absolute Deviation (MAD)0
Skewness1.786197531
Sum17515
Variance0.2561563697
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11008173.9%
 
2329524.1%
 
32391.8%
 
4200.1%
 
570.1%
 
62< 0.1%
 
ValueCountFrequency (%) 
11008173.9%
 
2329524.1%
 
32391.8%
 
4200.1%
 
570.1%
 
ValueCountFrequency (%) 
62< 0.1%
 
570.1%
 
4200.1%
 
32391.8%
 
2329524.1%
 

baths
Real number (ℝ≥0)

Distinct count12
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.227792436235708
Minimum0.0
Maximum5.0
Zeros17
Zeros (%)0.1%
Memory size106.6 KiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q31
95-th percentile2
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4417549202
Coefficient of variation (CV)0.3597960919
Kurtosis2.331313893
Mean1.227792436
Median Absolute Deviation (MAD)0
Skewness1.660588025
Sum16752
Variance0.1951474095
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11051777.1%
 
2270119.8%
 
1.52431.8%
 
2.5810.6%
 
3530.4%
 
3.5250.2%
 
0170.1%
 
0.52< 0.1%
 
42< 0.1%
 
3.251< 0.1%
 
Other values (2)2< 0.1%
 
ValueCountFrequency (%) 
0170.1%
 
0.52< 0.1%
 
11051777.1%
 
1.52431.8%
 
2270119.8%
 
ValueCountFrequency (%) 
51< 0.1%
 
42< 0.1%
 
3.5250.2%
 
3.251< 0.1%
 
3530.4%
 

rent
Real number (ℝ≥0)

MISSING

Distinct count1846
Unique (%)19.4%
Missing4132
Missing (%)30.3%
Infinite0
Infinite (%)0.0%
Mean2363.2186711522286
Minimum625.0
Maximum15160.0
Zeros0
Zeros (%)0.0%
Memory size106.6 KiB

Quantile statistics

Minimum625
5-th percentile1207
Q11720
median2110
Q32703
95-th percentile4045.7
Maximum15160
Range14535
Interquartile range (IQR)983

Descriptive statistics

Standard deviation1155.945331
Coefficient of variation (CV)0.4891402328
Kurtosis25.94459392
Mean2363.218671
Median Absolute Deviation (MAD)460
Skewness3.825272383
Sum22478936
Variance1336209.608
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1350630.5%
 
1695520.4%
 
1895470.3%
 
1899470.3%
 
1900460.3%
 
2250450.3%
 
2095400.3%
 
2000400.3%
 
2700390.3%
 
1850390.3%
 
Other values (1836)905466.4%
 
(Missing)413230.3%
 
ValueCountFrequency (%) 
6251< 0.1%
 
6451< 0.1%
 
6501< 0.1%
 
7501< 0.1%
 
7631< 0.1%
 
ValueCountFrequency (%) 
151604< 0.1%
 
148604< 0.1%
 
131604< 0.1%
 
125001< 0.1%
 
116604< 0.1%
 

sqft
Real number (ℝ≥0)

MISSING

Distinct count1524
Unique (%)11.3%
Missing213
Missing (%)1.6%
Infinite0
Infinite (%)0.0%
Mean747.04910282183
Minimum90.0
Maximum4380.0
Zeros0
Zeros (%)0.0%
Memory size106.6 KiB

Quantile statistics

Minimum90
5-th percentile317
Q1567
median696
Q3904
95-th percentile1227
Maximum4380
Range4290
Interquartile range (IQR)337

Descriptive statistics

Standard deviation298.8450342
Coefficient of variation (CV)0.4000339911
Kurtosis6.579931398
Mean747.0491028
Median Absolute Deviation (MAD)164
Skewness1.506451907
Sum10033616.5
Variance89308.35446
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
6001040.8%
 
670830.6%
 
1000710.5%
 
589700.5%
 
573640.5%
 
633590.4%
 
650590.4%
 
656570.4%
 
700560.4%
 
665550.4%
 
Other values (1514)1275393.5%
 
(Missing)2131.6%
 
ValueCountFrequency (%) 
902< 0.1%
 
145100.1%
 
1471< 0.1%
 
1494< 0.1%
 
1531< 0.1%
 
ValueCountFrequency (%) 
43801< 0.1%
 
33801< 0.1%
 
30002< 0.1%
 
29501< 0.1%
 
27201< 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size13.3 KiB
False
8028
True
5616
ValueCountFrequency (%) 
False802858.8%
 
True561641.2%
 

deposit
Real number (ℝ≥0)

MISSING

Distinct count70
Unique (%)36.5%
Missing13452
Missing (%)98.6%
Infinite0
Infinite (%)0.0%
Mean1567.1510416666667
Minimum100.0
Maximum6700.0
Zeros0
Zeros (%)0.0%
Memory size106.6 KiB

Quantile statistics

Minimum100
5-th percentile400
Q1887.5
median1287.5
Q32000
95-th percentile3632.75
Maximum6700
Range6600
Interquartile range (IQR)1112.5

Descriptive statistics

Standard deviation1043.523759
Coefficient of variation (CV)0.6658731235
Kurtosis3.339982309
Mean1567.151042
Median Absolute Deviation (MAD)537.5
Skewness1.550877642
Sum300893
Variance1088941.836
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1000280.2%
 
1500160.1%
 
500110.1%
 
120090.1%
 
200080.1%
 
75080.1%
 
80070.1%
 
4006< 0.1%
 
14005< 0.1%
 
3504< 0.1%
 
Other values (60)900.7%
 
(Missing)1345298.6%
 
ValueCountFrequency (%) 
1001< 0.1%
 
2501< 0.1%
 
3001< 0.1%
 
3504< 0.1%
 
4006< 0.1%
 
ValueCountFrequency (%) 
67001< 0.1%
 
50001< 0.1%
 
47001< 0.1%
 
44002< 0.1%
 
42001< 0.1%
 

unit
Categorical

HIGH CARDINALITY
MISSING

Distinct count1726
Unique (%)38.1%
Missing9113
Missing (%)66.8%
Memory size106.6 KiB
406
 
39
306
 
28
509
 
26
304
 
25
302
 
25
Other values (1721)
4388
ValueCountFrequency (%) 
406390.3%
 
306280.2%
 
509260.2%
 
304250.2%
 
302250.2%
 
301230.2%
 
321210.2%
 
207210.2%
 
501210.2%
 
311210.2%
 
Other values (1716)428131.4%
 
(Missing)911366.8%
 

Length

Max length20
Median length3
Mean length3.250219877
Min length1

leaseLength
Categorical

MISSING

Distinct count4
Unique (%)2.4%
Missing13480
Missing (%)98.8%
Memory size106.6 KiB
12
145
6
 
9
1
 
7
3
 
3
ValueCountFrequency (%) 
121451.1%
 
690.1%
 
170.1%
 
33< 0.1%
 
(Missing)1348098.8%
 

Length

Max length4
Median length3
Mean length3.010627382
Min length3

name
Categorical

HIGH CARDINALITY
MISSING

Distinct count4476
Unique (%)33.6%
Missing335
Missing (%)2.5%
Memory size106.6 KiB
Studio
 
405
A4
 
102
1 Bedroom
 
101
1x1
 
99
Standard Studio
 
95
Other values (4471)
12507
ValueCountFrequency (%) 
Studio4053.0%
 
A41020.7%
 
1 Bedroom1010.7%
 
1x1990.7%
 
Standard Studio950.7%
 
A1950.7%
 
A2920.7%
 
2x2910.7%
 
A3700.5%
 
S1670.5%
 
Other values (4466)1209288.6%
 
(Missing)3352.5%
 

Length

Max length50
Median length8
Mean length9.647903841
Min length1

new
Categorical

MISSING

Distinct count1
Unique (%)0.2%
Missing13121
Missing (%)96.2%
Memory size106.6 KiB
New
523
ValueCountFrequency (%) 
New5233.8%
 
(Missing)1312196.2%
 

Length

Max length3
Median length3
Mean length3
Min length3

applyNow
Categorical

MISSING

Distinct count2
Unique (%)0.6%
Missing13309
Missing (%)97.5%
Memory size106.6 KiB
Request To Apply
323
Apply Now
 
12
ValueCountFrequency (%) 
Request To Apply3232.4%
 
Apply Now120.1%
 
(Missing)1330997.5%
 

Length

Max length16
Median length3
Mean length3.313031369
Min length3

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

Unnamed: 0_xpropertyNamestreetcitystatezipCodeneighborhoodreviewScorereviewCountwalkScoretransitScoreUnnamed: 0_ybedRoomTypebedRoomNumberbathsrentsqftavailabilitydepositunitleaseLengthnamenewapplyNow
00Jackson Apartments2401 S Jackson StSeattleWA98144Atlantic0090700Studio11.01790.0465.0TrueNaNNaNNaNS.2.WNaNNaN
10Jackson Apartments2401 S Jackson StSeattleWA98144Atlantic0090701Studio11.01905.0475.0TrueNaNNaNNaNS.1.ENaNNaN
20Jackson Apartments2401 S Jackson StSeattleWA98144Atlantic0090702Studio11.01920.0570.0TrueNaNNaNNaNS.3.SNaNNaN
30Jackson Apartments2401 S Jackson StSeattleWA98144Atlantic0090703Traditional11.02060.0509.0TrueNaNNaNNaN0.18.WNaNNaN
40Jackson Apartments2401 S Jackson StSeattleWA98144Atlantic0090704Traditional11.02070.0657.0TrueNaNNaNNaN0.17.WNaNNaN
50Jackson Apartments2401 S Jackson StSeattleWA98144Atlantic0090705Traditional11.02075.0596.0TrueNaNNaNNaN0.4.ENaNNaN
60Jackson Apartments2401 S Jackson StSeattleWA98144Atlantic0090706Traditional11.02075.0597.0TrueNaNNaNNaN0.1.2NaNNaN
70Jackson Apartments2401 S Jackson StSeattleWA98144Atlantic0090707Traditional11.02075.0598.0TrueNaNNaNNaN0.6.WNaNNaN
80Jackson Apartments2401 S Jackson StSeattleWA98144Atlantic0090708Traditional11.02075.0620.0TrueNaNNaNNaN0.9.WNaNNaN
90Jackson Apartments2401 S Jackson StSeattleWA98144Atlantic0090709Traditional11.02075.0629.0TrueNaNNaNNaN0.20.SNaNNaN

Last rows

Unnamed: 0_xpropertyNamestreetcitystatezipCodeneighborhoodreviewScorereviewCountwalkScoretransitScoreUnnamed: 0_ybedRoomTypebedRoomNumberbathsrentsqftavailabilitydepositunitleaseLengthnamenewapplyNow
136347263 br, 1 bath House - 8533 18th Ave NW8533 18th Ave NWSeattleWA98117Crown Hill00855410506Traditional31.03600.01550.0TrueNaNNaNNaNNaNNaNNaN
136357271320 14th Ave S1320 14th Ave SSeattleWA98144Seattle00748910507Traditional21.01695.0750.0TrueNaNNaNNaNNaNNaNRequest To Apply
1363672910306 Holman Rd N10306 Holman Rd NSeattleWA98133Greenwood00855710517Studio11.01100.0400.0TrueNaNNaNNaNNaNNaNRequest To Apply
13637730311 6th Ave S311 6th Ave SSeattleWA98104Chinatown009910010518Studio11.01195.0386.0TrueNaNNaNNaNNaNNaNRequest To Apply
136387311307 N Northgate Way1307 N Northgate WaySeattleWA98133North College Park00876110519Studio11.01295.0288.0TrueNaNNaNNaNNaNNaNRequest To Apply
136397323631 13th Ave W3631 13th Ave WSeattleWA98119Queen Anne00695610520Traditional21.01600.0900.0TrueNaNNaNNaNNaNNaNRequest To Apply
13640733500 W Mercer St500 W Mercer StSeattleWA98119Lower Queen Anne00846310521Traditional21.02400.01180.0TrueNaNNaNNaNNaNNaNRequest To Apply
136417356 br, 3 bath House - 12705 27th Ave NE12705 27th Ave NESeattleWA98125Lake City00886010632Traditional63.03150.02400.0TrueNaNNaNNaNNaNNaNNaN
136427363230 SW Avalon Way3230 SW Avalon WaySeattleWA98126Fairmount Park00785810633Traditional11.01395.0605.0TrueNaNNaNNaNNaNNaNRequest To Apply
136437377719 Renton Ave S7719 Renton Ave SSeattleWA98118Brighton00766410634Traditional21.01725.0780.0TrueNaNNaNNaNNaNNaNNaN